AD  A  12  3  9  5  3 


ESI)  TR-82-40.T  Vol.  I 


MTR-8278,  Vol.  I 


TRANSFORM  DECODINOOl  REEI)  SOEOMON  CODES  VOLUME  I: 
ALGORITHM  ANDSlGNAi  PROCESSING  STRI  CT!.  RE 


B\ 

l>  O.CARHOl  \ 
B  I..  JOHNSON 
S  I  Ml  I. HAN 

NOVEMBER  1982 


Prepared  lor 

SOLID  STAT tl  SC  1 11N.C  l.S  DIV  ISION 
ROM  H  A I R  I) E VETO P\1  E N  I  CUN  I  E  R 
LN1TED STATES  AIR  FORCE 

Hanscom  Air  Force  Base.  Massachusetts 


DT!C 


SELECTE 

JAN  3  1  1983 


B 


>- 

Q_ 

O 

CJ) 

UJ 


Appi  * IV i’ll  l«  M  public  I  c’li'iisc*.  i 
ilisii  ibuiion  unlmnuil  ' 


Project  No.  70107170 
Prepared  bv 

THE  MITRE  CORPORATION 
Bed  lord,  Massachusetts 

Contract  No.  F19628-81-C-0001 


\\  I  KM  I  l  s  (  I.  *\ei  llllli -Ml  ill  .1  \\  MIL**'.  speiMK.i 
;  llOtlN.  ill  III  ill1 -I  li.lt  .1  .IH'  Uni'll  lol  •<  n\  |>tl/|'ll'-l 

ill. hi  .1  tlel  mill  lx  i  el.  iU  il  e**\ei  1 1 1  *  n 'in  pi" 
i  m  emeni  *»|vi  .i U» »u  1  Ik  i’i ix i  i  i mu  ni  i hei  i  h\  in 

.  tills  III!  t  1'<.|>I  lllsiht  1 1  (  \  Hoi  .MIX  i»I>I  li!.l  t  toll 

x\  Ii.Hsihxii  .  .iiul  lln  I. ui  ili.il  ilie  i-'i'Xii  iniiiiii 
ni.ix  ii.ixi  Itn  imil. iu ii.  t hi  iiinIk-iI  *>i  in  .m\  \\.i\ 

i  supplied  tllv'  s.ltll  lll.lXUUL'N  •'(Hi  il  u  .il  l»  'Ms  or 
i  olhei  il.n.i  is  iioi  to  l»i  1 1  >.mi  vli  il  l»x  iiiij»l»i  .ili*)ii 
I  i*i  i*l  Iki  xx  im\  .is  1 1 1  .nix  ui.iiiiK'i  IkiHsimj  I  lie 
I  Inililei  hi  .nix  ui  lie  i  pe  i  soil  i»i  v  "i  pm  .ilioii  oi 
i  iimex  iM>j  .nix  1 1  p  1 1 1  **  *  *  i  pel  mission  n*  m.miil.K 
due.  use.  <n  sell  .mx  p.iienieil  nixeniitm  ili.il 
nu ix  m.tnx  u.ix  he  i  eluted  iheieli* 


()•  *  m  it  i  el  m  n  I  Ills  v«ipx  Ki  l.mi  <*i  ili  si  j  <  >x 


R  E  V I E IV  A  N  D  A  PPR(>\  A  L 


Tli is  technical  report  lias  been  rc\  icwcil  anil  is  approx cil  lor  publication 


JKRRY  "SILVERMAN 
Project  Kngineer 


HAROLt)  ROTH,  Director 
Solid  State  Sciences  Division 


UNCLASSIFIED 


SECURITY  C.  OSSIFICATION  OF  This  PAGE  IHTien  D»lm  Frtt* red) 


REPORT  DOCUMENTATION  PAGE 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


4  Title  < and  Subtitle) 

TRANSFORM  DECODING  OF  REED-SOLOMON  CODES 

VOLUME  I :  ALGORITHM  AND  SIGNAL  PROCESSING 

5  TYPE  OF  REPORT  ft  PERlOO  COVERED 

STRUCTURE 

6  PERFORMING  O^G  REPORT  NUMBER 

MTR-8278 *  Vol.  I 

7  AuThORC*) 

D.  0.  CARI10UN,  B.  L.  JOHNSON,  S.  J.  MEEHAN 

8  CONTRACT  OR  GRANT  NUMBER'*/ 

F19628-8 1-00001 

9  performing  organization  name  and  address 

The  MITRE  Corporation 

Burlington  Road 

Bedford,  NA  0 1 7^0 

10  PROGRAM  ELEMENT  PROJECT  TASK 
AREA  S  WORK  UNIT  NUMBERS 

Project  No.  7010/7170 

11  CONTROLLING  OFFICE  NAME  AND  ADDRESS 

Solid  State  Sciences  Division 

12  REPORT  DATE 

November  1982 

Rome  Air  Development  Center 

Hanscom  AFB,  MA  01731 

13  number  of  pages 

66 

'4  MONITORING  AGENCY  NAME  ft  AOORESS(7/  i/i/Zereftf  from  Controlling  Office) 

15  SECURITY  CLASS,  (of  this  report 

UNCLASSIFIED 

ISa  DECLASSIFICATION  DOWNGRADING 
SCHEDULE 

16  DISTRIBUTION  STATEMENT  ( of  this  Report) 

Approved  for  public  release;  distribution  unlimited. 

17.  DISTRIBUTION  STATEMENT  (of  the  abstract  entered  in  Block  20,  if  different  from  Report) 

18  SUPPLEMENTARY  NOTES 

19  KEY  WORDS  (Con<<nu©  on  reverse  side  it  necessary  and  identity  by  block  number) 

ERROR  LOCATION 

REED-SOLOMON  ENCODING  AND  DECODING 
SIGNAL  PROCESSING 
TRANSFORM  DECODING 


20  ABSTRACT  (Continue  on  reverse  aide  If  necessary  and  identity  by  block  number) 

This  report  discusses  in  detail  a  transform  decoding  algorithm  and  its 
hardware  implications,  for  the  Reed-Solomon  codes,  that  offer  major  simplifi¬ 
cations  relative  to  the  conventional  BCH  decoding  algorithm.  A  fast  algorithm 
for  encoding  and  syndrome  computation  is  described.  Modification  of  the  error 
location  process  to  accommodate  erasures  is  also  described.  Also  discussed  are 
hardware  implementation  issues  with  a  summary  of  design  features  and  parameters 
to  be  incorporated  in  a  future  set  of  programmable  integrated  circuits  for 
decoding  a  large  number  of  Reed-Solomon  codes. 


DD  ,3^3  1473  EDITION  OF  1  NOV  65  IS  OBSOLETE  UNCLASSIFIED 

SECURITY  CLASSIFICATION  OF  THIS  PAGE  (When  D»<»  ChIvNI 


SECURITY  CLASSIFICATION  OF  THIS  PAGEfWhan  Da/a  Bnltnd) 


UNCLASSIFIED 


SECURITY  CLASSIFICATION  OF  Tu.r  PaGEflFhan  Data  Enlarad) 


PREFACE 


The  second  printing  of  Volume  I  of  this  report  coincides  with 
the  publication  of  ESD-TR-82-403 ,  Vol.  II,  Transform  Decoding  of 
Reed-Solomon  Codes  Volume  II:  Logical  Design  and  Implementation. 
The  revised  printing  of  the  first  volume  contains  no  new  material 
other  than  corrections  of  typographical  errors. 


ACKNOWLEDGMENTS 


This  document  has  been  prepared  under  Project  7010/7170, 
Contract  F19628-81-C-0001 .  The  contract  is  sponsored  by  the  S 'lid 
State  Sciences  Division,  Rome  Air  Development  Center,  Hanscom  Air 
Force  Base,  Massachusetts. 


j 

j 


i 


.  i  i  ,  Si'.l/O I' 

P'i  :.•*  pi'C  Ini 


1 


TABLE  OF  CONTENTS 


Section  Page 

LIST  OF  ILLUSTRATIONS  5 

LIST  OF  TABLES  6 

I  INTRODUCTION  7 

1.1  Purpose  7 

1 . 2  Background  8 

1.3  Scope  10 

II  'l  RANSFORM  APPROACH  12 

2.1  Linear  Block  Codes  12 

2.2  Maximum  Distance  Codes  12 

2.3  Reed-Solomon  Codes  13 

III  REED-SOLOMON  TRANSFORM  ENCODING  20 

3.1  Reed-Solomon  Encoding  20 

3.2  Codeword  Generation  by  Discrete  Transformation  20 

3.3  A  Fast  Transform  Algorithm  24 

3.4  Field-Programmability  of  the  Fast  28 

Transform  Algorithm 

3.4.1  Multi-Dimensional  Transform  29 

Partitioning 

IV  REED-SOLOMON  TRANSFORM  DECODING  ALGORITHM  37 

4.1  Syndrome  Computation  37 

4.2  Error  Locator  39 

4.2.1  The  Berlekamp-Massey  Algorithm  40 

4.2.2  Modification  for  Correction  of  44 

Errors  and  Erasures 

4.2.3  Continued  Fraction  Algorithm  47 

4.3  Message  Extraction  51 


3 


TABLE  OF  CONTENTS  (CONCLUDED) 


Section 

V  HARDWARE  DESIGN  (SUMMARY) 


5.1  Functional  Partitioning  and  Programmability 

5.2  Throughput 

REFERENCES 


A 


LIST  OF  ILLUSTRATIONS 


Figure  Page 

1  Conventional  BCH  Decoding  Algorithm  16 

2  Transform  Decoding  Algorithm  18 

3  Structure  for  Computing  the  Transform  by  23 

Polynomial  Evaluation 

4  Structure  for  Computing  a  31-Point  Transform  26 

over  GF(2^) 

5  Structure  for  Computing  a  255-Point  Transform  over  35 

GF(2°)  using  Two-Dimensional  Partitioning 

6  Canonic  Ladder  For  Stieltjes  Continued  49 

Fraction  Realization  (v=3) 

7  Transform  Section  53 

8  Error  Locator  54 


5 


LIST  OF  TABLES 


Table 
I 
II 
I ;  I-i 
II 1-2 

II 1-3 

IV 

V 

VI 


Comparison  of  Computational  Complexity 

Transforms  over  GF(2m) 

Minimal  Irreducible  Polynomials  over  GF(2  ) 

Minimal  Irreducible  Polynomials  over  GF(27) 
and  GF(2°) 

Minimal  Irreducible  Polynomials  over  G1 (2  ) 
and  CF(2A) 

Reed-Solomon  Codes  Accommodated  by  the  Hardware 
Design 

Half  Hate  Reed-Solomon  Codes  Accommodated  by 
the  Hardware  Design 

Decoding  Times  for  Half-Rate  Reed-Solomon  Codes 


Page 

27 

29 

31 

32 

33 

55 

56 

59 


6 


SECTION  I 


INTRODUCTION 


1. 1  Purpose 

One  of  the  continuing  concerns  of  the  Low  Cost  Electronics  pro¬ 
ject  is  the  application  of  new  technology  to  the  implementation  of 
error-correcting  codes  for  reliable  data  communication.  Our  interest 
stems  from  the  need  for  low-cost  hardware  to  implement  error-correction 
codes  that  exhibit  significant  coding  gain  on  interference-resistant 
communication  channels.  Previous  studies  of  coding  gain  led  us  to 
concentrate  work  on  the  implementation  of  the  Reed-Solomon  class  of 
generalized  BCH  codes.  This  class  of  codes,  although  well-suited  to 
the  correction  of  both  isolated  random  errors  and  random  error  bursts 
because  of  its  optimum  distance  properties,  continues  to  be  genuinely 
in  need  of  efficient  decoding  algorithms  implemented  by  low-cost  hard¬ 
ware  . 

This  report  examines  a  transform  decoding  algorithm  and  its  hard¬ 
ware  implications  for  the  Reed-Solomon  codes.  This  algorithm  offers 
major  simplifications  relative  to  the  more  conventional  BCH  decoding 
algorithm,  resulting  from  reduced  algorithm  complexity  and  from  the 
opportunity  to  apply  fast  computational  techniques  for  implementing 
its  major  portions. 

While  processing  functions  required  for  algebraic  coding  have 
been  associated  conventionally  with  digital  data  processing,  the 
basic  functions  of  the  transform  decoding  method  are  convolution  and 
discrete  transformation,  typical  signal  processing  functions.  An 
essential  difference  between  the  error  coding  requirements  and  the 
conventional  digital  signal  processing  practice  is  that  the  former 
require  the  operations  to  be  carried  out  in  a  finite  algebraic 
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field  or  ring.  Although  this  territory  may  seem  unfamiliar  to  the 
digital  signal  processing  community,  in  actuality  many  familiar 
concepts  apply  directly,  and  in  some  instances  implementation  may  be 
simpler  (e.g.,  elimination  of  carry  bits,  multiplication  by  adding 
’logarithms"!  if  the  unique  properties  of  the  finite  structure  are 
exploited.  Although  it  will  become  evident  that  the  error  coding 
functions  impose  a  relatively  high  degree  of  complexity  on  the  pro¬ 
cessing  circuitry,  it  is  hoped  that  the  analogies  with  more  conven¬ 
tional  linear  digital  signal  processing  functions  will  aid  in  the 
development  of  functional  LSI  hardware  -  and  perhaps  also  lead  to 
the  use  of  finite-field  methods  where  useful  and  appropriate  for 
other  linear  signal  processing  purposes. 

1 . 2  Background 

Previous  work  on  the  low-cost  electronics  project  demonstrated 
that  error  correction  coding,  when  used  in  conjunction  with  spread- 
spectrum  modulation,  provides  an  added  dimension  to  the  design  of 
jam-resistant  communications  systems.  Error  correction  codes 
are  useful  to  correct  message  errors  caused  by  interference,  additive 
random  noise,  and  other  disturbance  present  on  the  channel.  They  do 
so  by  adding  redundant  symbols  according  to  a  predetermined  strategy 
(coding)  and  then  extracting  the  correct  message  from  the  noisy 
received  signal  (decoding),  aided  by  prior  knowledge  of  the  code, 
(liven  sufficent  channel  signal-to-noise  ratio  (measured  at  the 
receiver)  it  can  be  shown  that  this  represents  an  effective  means 
of  obtaining  low  error  probability  in  the  decoded  message,  even 
though  the  energy  per  transmitted  symbol  is  reduced  because  of  the 
added  redundai.  ’y  (we  assume  a  fixed  energy  available  per  source 
message  symbol).  In  other  words,  the  coding  strategy  produces  an 
effective  gain  (at  a  modest  price  in  bandwidth)  that  more  than  com¬ 
pensates  for  the  necessity  to  spread  the  available  energy  over  the 
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combined  message  and  redundancy  symbols.  The  net  result  is  a  coding 
gain  obtained  by  computational  processing  that  may  range  from  a  few 
dB  for  channels  corrupted  by  additive  gaussian  noise  to  some  tens  of 
dB  for  randomly  fading  noisy  channels.  As  the  tolerance  to  errors 
in  the  output  data  is  lowered,  the  arguments  for  using  error  coding 
are  strengthened. 

Under  this  project  we  previously  examined  the  application  of  the 
BCH  decoding  algorithm  for  decoding  Reed-Solomon  codes  with  particu¬ 
lar  attention  to  it:,  structural  implementation.  We  also  exper¬ 
imented  with  the  direct  decoding  of  Reed-Solomon  codes  having  short 
block  lengths  by  implementing  a  code-table  search  algorithm  under 
microprocessor  control.  The  work  reported  here  is  concerned 
with  the  algorithm  and  structures  for  decoding  Reed-Solomon  codes  by 
application  of  a  transform  method. 

If  the  message  symbols  to  be  encoded  by  a  Reed-Solomon  code  lie 
in  the  domain  of  a  linear  Fourier-like  transf ormat ion  from  the  mes¬ 
sage  to  the  sequence  that  is  transmitted  over  the  communication 
channel,  then  it  is  possible  to  eliminate  a  number  of  computational 
steps  in  the  decoding  algorithm.  The  finite-field  linear  transforms 
involved  are  directly  analogous  to  the  discrete  Fourier  transform 
pair  defined  over  the  complex  number  field,  except  that  in  this  case 
they  are  defined  in  the  finite  algebraic  field  that  contains  the 
code  synhols.  The  transform  algorithm  can  be  summarized  by  the 
following  steps: 

Encoding 

1.  Define  a  sequence  in  the  algebraic  domain  of  the 

channel  code  transform  as  the  message  symbols  accom¬ 
panied  by  a  sequence  of  zeros. 

The  transform  method  of  decoding  Reed-Solomon  codes  has  a  historical 
precedent  in  the  definition  of  the  codes,  but  only  recently  has 
there  been  renewed  interest  in  its  application  [1],  [2],  [3j. 
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2.  iV'nerato  the  channel  codeword  by  taking  the  forward 
transform  of  the  defined  sequence. 

(This  produces  a  ron-systemat ic  channel  code.) 

Decod  ing 

1.  Take  the  inverse  transform  of  the  received  se  1  ■  nee. 

d.  Isolate  the  error  syndrome  from  the  inverse  transform, 
and  use  it  to  determine  the  error  locator  polynomial 
or  equivalently  the  connection  polynomial  of  a  cor¬ 
responding  linear  feedback  shift  register  (I.FSR). 

1.  F.xtrapolate  the  inverse  error-transform  sequence  using 
the  unforced  response  of  the  synthesized  LFSR,  and 
subtract  it  f >•  ?m  the  inverse  transform  of  the  received 
sequence.  The  difference  is  the  corrected  message 
sequence . 

In  comparison  vith  the  convontio'al  BCH  decoding  algorithm  as  applied 
to  Reed-Solomon  codes,  the  transform  decoding  algorithm  eliminates 
the  need  to  search  for  roots  c r  the  error  locator  polynomial.  It  is 
also  unnecessary  to  calculate  the  error  values  in  the  co-domain  of 
the  channel  code,  a  step  that  in  effect  solves  a  set  of  simultaneous 
linear  equations.  For  a  non-systematic  channel  code,  the  BCH  decoding 
algorithm  requires  polynomial  division  by  the  code  generator  poly¬ 
nomial  of  the  corrected  received  sequence  in  order  to  retrieve  the 
information  symbols.  This  step  is  eliminated  in  the  transform 
algorithm  because  the  message  symbols  are  defined  in  the  domain  that 
contains  the  inverse  error  transform  values. 

1 . 3  Scope 

The  next  section  of  this  report  describes  in  overview  the  Reed- 
Solomon  codes  and  the  transform  decoding  algorithm  in  a  convenient 
algebraic  setting  in  which  the  discrete  transform  and  its  inverse 
are  regarded  as  equivalent  to  polynomial  evaluation  and  inter¬ 
polation.  In  Section  III  a  fast  algorithm  for  ii..p  1  erne.  *  i ng  the 


in 


r. 


transform  and  its  application  for  the  encoder  are  discussed. 

Section  IV  presents  the  major  steps  of  the  decoding  algorithm,  in¬ 
cluding  modification  of  the  basic  algorithm  to  accommodate  the  cor¬ 
rection  of  both  errors  and  erasures  up  to  the  minimum  distance  limit 
of  the  code. 

In  Section  V  we  discuss  some  hardware  implementation  issues  and 
present  a  summary  of  recommended  design  features  and  parameters  to 
be  incorporated  in  a  set  of  integrated  circuits  to  enable  decoding 
of  a  large  number  of  Reed-Solomon  codes.  The  second  volume  of  this 
report  [4]  documents  the  detailed  logical  design  of  circuitry  and 
the  construction  of  a  discrete  logic  breadboard. 
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THK  TRANSFORM  APPROACH 


2 .  1  Linear  Block  Codes 

A  linear  block  code  is  a  linear  mapping  (homomorphism)  from  a 
k-dimensional  vector  space  to  a  larger  n-dimensional  vector  space; 
the  redundancy  is  n-k.  The  message  space  may  consist,  for  example, 

TT1  k 

of  (')')  sequences  —  or  k-tuples  —  of  m-bit  message  symbols  while 

the  code  space  consists  of  (2m)n  code  sequences  -  or  n-tuples  -  of 

m  k 

m-bit  symbol  -  ci-nrauiing  (2')  code  sequences  that  combine  both 
message  and  redundancy.  The  mapping  increases  the  distance  (or 
spacing)  between  code  words  corresponding  to  message  sequences,  dis¬ 
tance  being  measured  as  the  number  of  different  symbols  between  pairs 
of  such  code  words.  In  fact  this  distance  increases  exponentially  as 
the  redundancy  increases  linearly.  When  random  errors  occur  in  the 
transmitted  code  words,  the  increased  spacing  permits  decoding  by  a 
maximum  distance  (equivalent  to  maximum-likelihood)  decoding  rule. 

When  the  block  length  of  the  code  becomes  sufficiently  long,  such  a 
decoding  procedure  becomes  impractical.  One  may  then  resort  to  the 
algebraic  properties  of  the  code  construction  to  facilitate  practical 
decoding. 

2 . 2  Maximum  Distance  Codes 

The  Reed-Solomon  codes  in  which  we  are  interested  are  members  of 
a  class  of  linear  block  codes  called  maximum  distance  separable  codes. 
These  are  codes  for  which  the  minimum  distance  between  pairs  of  code 
words  equals  the  maximum  value 

d  =  n  -  k  ■!-  1  (1) 

admitted  by  the  Hamming  volume  (or  sphere-packing)  bound.  For  such 
codes  it  is  possible  to  correct  a  number  of  symbol  errors  not  exceed¬ 
ing  one-half  the  redundancy  of  the  code.  Arguments  based  on  the 


triangle  inequality  can  be  used  to  show  that  twice  as  many  erasures 
are  correctable,  an  erasure  (rather  than  an  error)  being  defined  as 
a  codeword  svi'bol  of  unknown  (rather  than  incorrect)  value  occurring 
at  a  known  (rather  than  unknown)  symbol  location.  In  fact,  it  can 
be  shown  that  it  is  possible  to  correct  any  combination  of  t  errors 
and  s  erasures  provided  that  the  inequality 

2t  +  s  £  n  -  k  (2) 

is  satisfied  for  the  maximum  distance  codes. 

In  order  to  discuss  the  structural  properties  of  such  codes 
and  the  apparatus  for  coding  and  decoding,  it  is  convenient  to  de¬ 
scribe  an  n-tuple  of  m-bit  symbols  by  a  polynomial  of  degree  n- 1 
having  coefficients  that  are  members  of  the  finite  algebraic  field 
of  2m  elements.  Such  a  polynomial  is  determined  uniquely  by  its  n 
coefficients,  or  equivalently  by  its  values  at  any  n  distinct  points 
of  the  field.  A  code  word  of  block  length  n,  for  example,  may  be 
specified  either  by  a  set  of  n  values  or  by  the  polynomial  coeffi¬ 
cients  interpolated  from  those  values. 

2. J  Reed-Solomon  Codes 

A  maximal  distance  code  of  block  length  n  and  dimension  k  can 
be  generated  by  a  set  of  k  functions  (g^(z)}  defined  over  an  alge¬ 
braic  field  F  containing  at  least  n  elements  provided  that  the  func¬ 
tions  in  the  ret  are  independent  and  that  no  linear  combination  of 

them  has  more  than  k-1  roots  in  the  field.  The  Reed-Solomon  codes 

23  k  1 

are  a  special  case  for  which  the  function  set  is  {1,  z,  z  ,  z  zK  i}[5]. 

We  are  concerned  here  with  codes  (and  messages)  for  which  the 
symbols  are  binary  m-tuples.  These  are  elements  of  the  Galois  field 
of  2m  elements.  The  elements  of  this  finite  field  can  be  written  as 

2  i 

0,  b,  b  ,  ....  b-  -i,  where  b  is  a  primitive  element  of  the  field. 

For  the  Reed-Solomon  codes,  let  the  message  be  represented  by  the 

2  k-1 

polynomial  f  (x)  =  aQ  +  a^z  4-  a2z  +  ...  +  a^^z  where  the  coeffi¬ 
cients  a^  are  elements  of  the  field  F  =  GF(2m)  that  represent  the  set 
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of  k  message  symbols.  The  mapping  from  the  message  space  to  the 

code  space  is  accomplished  by  evaluating  the  message  polynomial  fix) 

at  tlie  2m-l  non-zero  points  of  the  field.  The  transmitted  code  word 

consists  of  the  sequence  of  n  =  2'"-l  values 

a  9m_i 

f(b),  foo,...,  fur  l) 

resulting  from  the  polynomial  evaluation.  To  decode  the  message 
after  receiving  the  code  sequence,  one  may  form  the  n  equations 

f(b)  =  a,,  +  a.b  +  a,b^  +  .  .  .  +  a  .  b^  * 

012  k-1 

a  •>  4  2k-2 

f(b“)  =  a0  +  aLb  +  a2b  +  .  .  .  +  a^b 


(3) 


f(b2  _1>  =  a0  +  +  a,  +  .  .  .  +  afe_j 

where  the  coefficients  a.  are  the  unknown  values  to  be  found  bv 

l 

solution  of  the  equation  -et.  Any  subset  of  k  of  there  equations 
is  linearly  independent  and  may  therefore  be  used.  If  one  or  more 
of  the  code  symbols  are  received  in  error,  then  different  solutions 
may  result  from  the  various  k-subsets.  The  errors  may  be  corrected 
bv  solving  all  of  the  distinct  k-subsets  and  taking  a  majority  vote. 
Such  an  approach,  although  valid  in  principle,  is  generally  not 
practical  to  implement  directly  because  of  the  large  number  of  equation 
sets  to  be  solved,  each  implying  the  inversion  of  a  square  matrix  of 
dimension  k. 

Combinatorial  methods  based  on  (n, k, t )-cover ing  systems  have 
been  devised  as  a  way  of  trapping  the  correctable  error  patterns  in 
order  to  reduce  systematically  the  number  of  equation  sets  to  be 

solved.  When  combined  with  a  minimum  distance  decoding  rule,  the 
covering  method  can  yield  practical  results  for  codes  of  modest 
block  length.  It  can  also  accommodate  shortened  cyclic  codes. 
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For  codes  of  moderately  long  block  length,  one  can  successfully 
apply  a  concise  decoding  algorithm  based  on  the  algebraic  properties 
and  structure  of  the  code.  The  well-known  BCH  decoding  algorithm 
depends  on  calculating  an  error  syndrome  as  a  linear  transformation 
of  the  received  channel  symbols  and  then  using  the  syndrome  to 
determine  an  error  locator  polynomial  as  the  solution  of  a  linear 
recursion  (the  so-called  key  equation)  [6].  The  roots  of  the  error- 
locator,  which  may  be  found  by  a  systematic  root-search,  designate 
the  locations  of  the  errors  (or  the  multiplicative  inverses  of  the 
locations).  These  values  are  then  used  to  solve  a  set  of  linear 
equations  for  the  channel  errors,  enabling  correction  of  the  channel 
code  and  subsequent  extraction  of  the  message  [7]. 

The  error  locator  is  the  key  step  of  the  algorithm.  It  may  be 
regarded  as  a  means  of  determining  which  set  of  k  equations  drawn 
from  the  full  set  of  equation  (3)  is  sufficient  to  solve  for  the  message 
symbols,  since  having  determined  which  values  f(bX)  are  in  error, 
the  remaining  ones  must  be  hypothesized  to  be  correct.  The  conven¬ 
tional  BCH  decoding  algorithm  is  described  schematically  in  Figure  1. 

In  our  work  we  resort  to  a  transform  decoding  algorithm  for  the 
Reed-Solomon  codes  that  is  a  variation  of  the  BCH  decoding  algorithm 
which  allows  a  major  reduction  in  the  computational  complexity  of  the 
algorithm.  The  principal  features  of  our  implementation  include: 

1.  Interpolation  of  the  code  word  sequence  to  produce 
a  coefficient  syndrome. 

2.  Solution  of  a  linear  recursion  based  on  the  interpolated 
coef  f icients . 

3.  Extrapolation  of  a  sequence  of  error  coefficients 
to  extract  the  message  coefficients. 

It  will  be  shown  that  the  first  step  is  equivalent  to  taking  an  n- 
point  transform  (defined  over  the  finite  field  C,F(2m))  that  is 
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14-61.248 


Figure  1.  Conventional  BCD  Decoding  Algorithm 


completely  analogous  to  the  discrete  Fourier  transform  defined  over 
the  field  of  complex  numbers.  The  second  step,  which  determines 
the  symbol  error  locations,  uses  an  algorithm  (Berlekamp-Massey) 
that  synthesizes  the  shortest  linear  feedback  shift  register  that 
generates  a  prescribed  sequence  [8j.  Each  iteration  of  this  algor¬ 
ithm  involves  a  convolution  of  the  syndrome  coefficients  with  the 
shift  register  tap  weights.  The  third  step  utilizes  the  unforced 
response  of  the  synthesized  LFSR,  again  involving  convolution.  A 
simple  block  diagram  of  the  decoding  apparatus  is  shown  in  Figure  2. 

The  transform  approach  to  Reed-Solomon  codes  is  evident  in  their 
original  exposition  [11,  the  connection  between  the  message  symbols 
and  the  channel  code  being  variously  regarded  as  based  on  the  Mattson 
Solomon  polynomial  [9],  the  Chinese  remainder  theorem  [10], 
LaGrange's  interpolation  formula  [11],  and  more  recently  number- 
theoretic  Fourier-like  transforms  [2,4,12,13].  The  computational 
advantages  of  a  transform  approach  to  encoding  and  decoding  relative 
to  the  more  or  less  standard  BCH  decoding  algorithm  have  been  pre¬ 
viously  discussed  [2,4,12,13]  but  efficient  hardware  implementing 
these  codes  is  still  not  commonly  available.  The  principal 
computational  advantages  are  (1),  the  ability  to  both  encode  and 
compute  the  error  syndrome  with  fast  algorithms  that  mimic  FFT  algo¬ 
rithms,  and  (2),  the  ability  to  predict  the  transform  of  the 
channel  error  pattern  from  the  error- locator  polynomial.  The  second 
advantage  avoids  a  root-search  for  the  error-locations,  followed  by 
generation  of  the  error  evaluation  polynomial  and  explicit  compu¬ 
tation  of  the  channel  error  values.  Instead,  to  decode  one  needs 
only  to:  a)  compute  the  transform  of  the  received  channel  sequence, 
the  error  syndrome  constituting  a  subset  of  the  transform,  b)  compute 


Transform  Decoding 


,  ,  .  iv-prative  algorithm  (Berlekamp- 

““  error  locator  „hlch  aerates  or,  the  syn- 

Massey  algorithm  or  continu  error  sequence  as  a 

drome ,  and  O  generate  the  transform  ot 

Unear  recursion  with  the  error  locator  po  y  omial. 
the  third  step  was  proved  by  Reed,  et  al.  1- 


19 


SECTION  III 


RE ED- SOLOMON  TRANSFORM  ENCODING 


3 . 1  Reed-Solomon  Encoding 

Reed-Solomon  codes  were  first  described  in  1960  1 2 1 .  Shortly 
afterwards  they  were  generalized  and  a  decoding  algorithm  based  on 
their  algebraic  properties  war.  suggested  [6].  Today  these  codes 
are  usually  described  as  the  polynomial  product  of  a  generator 
function  g(z)  defined  over  a  finite  field  GF(q)  with  an  information 
sequence  polynomial  defined  over  the  same  field  [7].  In  algebraic 
terminology,  the  code  forms  an  ideal  in  the  ring  of  polynomials 
modulo  z11  -  1  over  GF (q)  that  is  generated  by  g(z).  The  roots  of  g(z), 
which  are  contained  in  GF(q),  consist  of  a  consecutive  set  cf  elements 
of  the  cyclic  multiplicative  group  of  GF^q).  Since  the  code  ideal 
consists  of  all  products  of  g(z),  each  member  when  evaluated  at  any 
of  the  roots  of  g(z)  becomes  congruent  to  zero.  This  is  the  basis 
for  determining  the  error  syndrome  of  the  channel  sequence  which  is 
required  for  further  decoding.  Below  we  discuss  the  generation  of 
codewords  from  the  transform  viewpoint. 

3. 2  Codeword  Generation  by  Discrete  Transformation 

We  have  described  the  function  of  code  generation  in  Section  II 
as  n-point  evaluation  of  a  message  polynomial  defined  over  a  finite 
algebraic  field.  The  inverse  function  is  interpolation  of  the  poly¬ 
nomial  from  Its  n  values.  These  functions  (evaluation  and  interpol¬ 
ation)  are  inverse  in  the  sense  that  their  (commutative)  product  is 
the  identity  function.  it  has  also  been  shown  that  these  functions 
form  a  "transform"  pair  analogous  to  the  discrete  Fourier  transform 
pair  defined  over  the  complex  number  field  [14],  In  our  case  the 
transforms  are  defined  over  the  finite  algebraic  field  of  the  code. 

Let  a.,  a.  a  ,  be  distinct  elements  of  a  finite 

0  1  n-1 
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algebraic  field  GF (pm)  of  order  pm  -  1,  having  an  element  b  of  order 
n.  The  linear  transformation 

n-1 


A,  -^b13  (O 

i=0 


is  an  endomorphic  mapping  of  GF (pm) .  It  is  assumed  that  n  divides 
pm  -  1,  the  order  of  the  field,  and  for  our  purposes  n  =  pm  -  1. 

In  that  case  the  field  element  b  is  a  primitive  n1^  root  of  unity. 
It  can  be  shown  that  for  any  integer  r. 


n-1 


i=0 


n,  r  =  0  mod  n 


0,  otherwise 


(5) 


and  the  property  can  be  used  to  verify  by  direct  calculation  that 
the  mapping  that  is  inverse  to  that  of  equation  (A)  is  the  linear 
transformation 


n-1 

ai=n  1  ^  ^  Ajb  ^  (6) 

3=0 

where  -n  n  =  pm  -  1.  Equations  (A)  and  (6)  define  a  discrete  trans¬ 
form  pair  over  GF(pm)  and  the  operations  of  addition  and  multipli¬ 
cation  are  defined  in  the  same  field.  Addition  may  be  performed  as 
modulo-p  addition  of  the  m-tuples  that  are  the  field  elements  com¬ 
posing  the  sum.  Multiplication  may  be  defined  by  addition  of  in¬ 
dices  of  the  field  elements 


b  b  =  b 


,  (r  +  s)  mod  p 


(7) 


The  transform  pair  of  equation  (A)  and  equation  (6)  is  analogous 
to  the  discrete  Fourier  transform  pair  for  which  b  would  be  a  complex 
n*”^  root  of  unity  and  the  arithmetic  would  be  defined  in  the  complex 
number  field.  The  conceptual  value  of  the  Fourier  transform  pair 
is  preserved  in  the  finite  field.  In  particular,  the  cyclic 
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convolution  property  holds.  Fast  computational  algorithms,  analogous 
to  the  FFT  algorithms,  can  also  be  applied. 

If  the  sequence  to  be  transformed  is  expressed  as  a  polynomial 
over  t.l  (p  ) 

a  ( 2 )  =  a  +  a  z  a  z  "+...+  a  z  n  1  ,8) 

old  n-1 

then  the  transform  of  the  sequence  a  ,  a.,  a, . a  .is 

o’  1’  2’  n-1 

identical  with  polynomial  evaluation  of  a(z)  at  the  n  distinct 
O  1  2  n-1 

points  b  ,  b  ,  b  ,  .  .  .,  b  and  the  inverse  transform  is  identical 
with  interpolation  of  the  polynomial  a(z)  from  its  n  values. 

In  order  to  generate  a  Reed-Solomon  (n,k)  code ,  we  let  the  set 
,  a^,  .  .  .,  ak_j  represent  the  message  symbols,  setting  a.  =  0  for 
i  =  k,  k  +  1,  .  .  . ,  n  -  1,  and  evaluate  the  resulting  polynomial 
(by  calculating  the  transform  of  the  message  sequence)  at  the  n  =  p‘  -  1 
non-zero  units  of  the  field.  The  transform,  or  polynomial  evaluation, 
can  be  expressed  as  a  continued  product 

a(bh  =  a  +  (a  +  .  .  .  +  (a  +  b^a  )  .  .  .)  (9) 

o  1  n-2  n-1 

or  equivalently  it  can  be  interpreted  as  the  remainder  of  the  poly¬ 
nomial  division  a(z)  /  (z-b-1)  evaluated  at  b-1 .  The  second  inter¬ 
pretation  may  be  represented  as  the  set  of  polynomial  congruences 

aClr1)  5  a(z)  mod  (z-bh ,  j  =  0,  1,  .  .  n  -  1.  (10) 

A  structure  for  computing  the  transform  is  shown  in  Figure  3.  Notice 

that  the  same  structure  can  be  used  for  calculating  both  the  trans- 

2 

form  and  its  inverse.  The  method  shown  requires  n  separate  products 
in  GF(pm)  to  be  formed  by  sequencing  n  symbols  through  a  set  of  reg¬ 
isters  containing  n  separate  multipliers.  Since  the  hardware 
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complexity  of  finite  field  multiplication  is  formidable,  we  should 
look  for  "fast"  computational  algorithms  that  reduce  the  number  of 
multiplications  in  GF(pU).  Alternatively,  one  could  use  table-look¬ 
up  multiplication  at  the  sacrifice  of  speed, 
i ■  '  A  Fast  Transform  Algorithm 

A  fast  transform  algorithm  --  one  that  tends  to  minimize  the 

m 

number  ot  multiplications  in  (',F(p  )  —  can  be  devised  to  compute  the 
transform  pair  of  equation  (4)  and  equation  (6).  The  set  of  congru¬ 
ences  of  equation  (10)  can  be  calculated  in  principle  by  dividing 
the  polynomial  a  (z)  separately  bv  the  first  degree  polynomials  (z-b’.i, 
ken  piny  only  the  remainders.  That  is  operationally  equivalent  to 

evaluating  u(z)  at  the  n  non-zero  field  points  b’ .  In  either  case 

2  m 

n  -ul  i  Lpl  icat  ions  in  C.F(p  )  are  implied. 

An  equiv.’ent  method  is  to  first  divi’e  a(z)  by  a  smaller  set  of 
polynomials  of  higher  degree  containing  distinct  factors  of  the 
form  (z-b’),  and  then  to  evaluate  the  remainder  polynomials  at  the 
appropriate  values  bf.  If  this  set  of  divider  polynomials  is  the  set 
of  minimal  polynomials  of  the  non-zero  field  elements,  then  their 
coefficients  are  elements  of  the  prime  field  GF(p)  so  that  only  scalar 
m  ltiplication  by  the  elements  of  the  prime  field  is  required  in  the 
first  step.  This  is  particularly  significant  when  p  =  2  and  the  cor¬ 
responding  scale  factors  are  either  zero  or  one.  The  equivalence  of 
the  two  methods  is  easily  seen  by  examination  of  the  factorization  over 
C,F(pm)  of  the  polynomial  zn  -  1: 


yn  -  1  -  7""[  (z-b-’);  be  GF(pm)  (11) 

j=0 

which  can  also  be  expressed  as  the  product  o>‘  the  minimal  polynomials 
nm(z)  having  as  roots  the  value  b1  and  its  conjugates  (b1)’5  .  This 
factorization  is  given  explicitly  by 
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M 

TT 

i=  1 

whore  M  is  the  number  of  irreducible 

m.(z)  =  (z-h1)  (z-blp)  .  .  .  (z-blpq).  (12b) 

The  minimal  polynomials  m^(z)  have  their  coefficients  restricted  to 
the  prime  field  GF(p). 

To  complete  the  transform  computation,  we  must  evaluate  each  of 
the  remainders  at  the  conjugate  roots  of  the  associated  minimal  poly¬ 
nomial  divisor.  This  second  step  requires  multiplications  in  GF(pm) , 
but  the  number  is  substantially  reduced  because  there  is  a  relatively 
small  number  of  remainder  polynomials,  each  of  smaller  degree  than 
the  degree  of  field  extension. 

As  an  illustration  of  the  fast  algorithm,  a  structure  for  compu¬ 
tation  of  a  31-point  transform  over  GF(2^)  is  shown  in  Figure  4. 

Notice  that  division  by  the  5th  degree  minimal  polynomial  factors 
of  z31-l  is  accomplished  in  a  set  of  six  binary  feedback  shift  reg¬ 
isters  of  length  m  =  5.  The  additional  transform  point  corresponding 
to  division  by  the  first  degree  polynomial  factor  z-1  is  calculated 
simply  by  summing  the  coefficients  of  a(z).  In  the  structure  shown 
we  use  a  single  multiplier  in  GF(2^)  sequentially  to  evaluate  the 
remainder  polynomials.  This  choice  of  implementation  is  used  in  the 
hardware  design  of  our  encoder  and  decoder  because  of  the  relative 
complexity  of  such  a  multiplier  and  also  because  it  supplies  data 
at  a  sufficient  rate  for  the  remaining  operations.  For  the  example 
given,  the  number  of  multiplications  in  GF(2^)  resulting  from  the 
remainder  evaluation  has  been  reduced  from  930  to  120  by  the  use  of 
this  algorithm.  The  corresponding  reductions  for  some  other  situations 
of  interest  are  listed  in  Table  I. 


m.  (z)  (12a) 

factors  of  zn  -  1  and 
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RESIDUE  CALCULATOR 


ting  a  '31 -Point 


3 . 4  Field  Programmability  of  the  Fast  Transform  Algo rithm 

The  hardware  for  the  fast  transform  algorithm  described  in 
section  3.3  can  be  reconfigured  easily  to  compute  transforms  of 
appropriate  lengths  in  the  different  Galois  fields  CF(2m).  In  each 
field  of  order  2m  a  transform  of  n  points  can  be  defined  for  every 
integer  n  that  divides  2m  -1.  Thus  in  GF(2^),  transforms  of 
255,  85,  51,  17,  5  and  3  points  and  the  trivial  1-point  (identity) 
transform  can  be  defined.  Division  by  the  minimal  polynomial  factors 
that  split  zn  -1  is  performed  in  a  set  of  binary  feedback  shift  reg¬ 
isters  of  length  m.  There  are  m  of  these  registers  identically  con¬ 
figured,  operating  on  the  m-bit  symbols  of  the  input  sequence,  and 
used  in  association  with  each  of  the  minimal  polynomial  factors.  As 

an  example,  for  a  31-point  transform  over  GF(2~*),  there  are  six  5th- 

31 

degree  factors  and  one  first-degree  factor  that  split  z  -1.  Division 
by  each  polynomial  factor  requires  5  binary  feedback  shift  registers 
wired  with  the  same  connection  polynomial,  as  shown  in  Figure  4. 

We  have  designed  our  Reed-Solomon  coder  and  decoder  to  accommo¬ 
date  codes,  of  both  maximum  and  sub-maximum  lengths,  having  symbol 
fields  ranging  from  4  to  8  bits  per  symbol.  The  lengths  of  the  non¬ 
trivial  transforms  defined  in  the  various  fields  are  tabulated  in 
Table  II  together  with  the  minimal  polynomial  factors  of  z°  -1  for  the 
different  transform  lengths  n  in  each  field.  The  minimal  polynomials 
are  given  in  Table  1 1 1  — 1  through  Table  1 1 1 — 3 .  If  we  provide  for  a  255- 

g 

point  transform  over  GF(2  );  by  reconfiguring  the  lengths  and  con¬ 
nection  polynomials  of  the  feedback  shift  registers,  all  of  the  other 
cases  can  be  accommodated.  Of  course,  the  multiplier  in  GF(2m)  that 
is  used  in  evaluating  the  remainders  needs  also  to  be  reconfigured. 

The  design  details  of  the  hardware  that  can  be  electronically  reconfig¬ 
ured  to  compute  the  full  set  of  transforms  enumerated  in  Table  II  are 
described  in  Volume  II  of  this  report.  We  have  constructed  a  bread¬ 
board,  with  medium  scale  logic,  of  a  transformer  used  in  a  Reed-Solomon 
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TABLE  II 


Transforms  over  GF(2  ) 


Field  of 
Calculation 


Transform  Length 
N 


Required  Minimal  Polynomial 
Divisors 


GF(28) 


GF(27) 


255 


85 


51 


17 


15 


127 


11 
"17 
"23 
"29 
"39 
"47 
"55 
"63 
"91 
"l  t  9 


(z), 
(z)  , 
(z), 
(z), 
(z)  , 
(z)  , 
(z), 
(z)  , 
(z), 
(z)  , 
(z), 
(z), 


m. 


m 


13 
1 1 9 
25 
"31 
"43 
"51 
"59 
n85 
"95 
n127 


(z), 
(z)  , 
(z), 
(z)  , 
(z)  , 
(z)  , 
(z)  , 
(z)  , 
(z)  , 
(z)  , 
(z)  , 
(z) 


m3  (z), 
(z), 

ml5 

m21  2  ’ 
m„7  (z), 

(z), 

m.%  (z), 

m”  (z), 

(2), 

(2). 

mul(z) , 


mn 

(z). 

m3 

(z) , 

mQ 

(z) , 

u 

m15 

m39 

m63 

(z)  , 
(z), 
(z). 

m21 

m45 

m87 

(z) , 
(z)  , 
(z). 

m27 

m51 

mlll 

(z), 
(z)  , 
(z) 

m 


0 


(z)  , 
(z)  , 


mr 


m25  ,  , 
nig5  (z), 


(z) , 
(z), 


m45  ,  v 
m95  (2) 


m  (z), 
m55  (z). 


0 


(z) , 


m15  (z), 


m45  (Z) 


mn  (z), 
rag5  (z), 


m17  (z), 
ml  19  <Z) 


m^^  (z). 


(z). 


mgl  (z) 


(z). 


m85  (Z) 


mr 


(z)  , 
(z) , 

nii  (z!’ 

m19 

m27  2  ’ 

m43  2  ’ 

m63  (z) 


m. 


(z), 
(z), 

m13  (ZJ’ 
m21  2  * 
m29  2  ’ 

m47 


m„ 


(z), 
(z)  , 
nii  c  (2), 
(z), 

m,,  (z), 
m^3  (z), 


*  Cases  included  in  breadboard 
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TABLE  II  (Concluded) 


Transforms  over  CF(2m) 


Field  of 
Calculation 

Transform  Length 
N 

Required  Minimal  Polynomial 
Divisors 

CF(26) 

63 

m  (z)  ,  m  (z) ,  m  (z) , 

m5  (z),  m?  (z),  n>9  (z), 

fig  .  (z) ,  m,  (z) ,  m, .  (z) , 

m21  (z),  m22  (z),  m27  (z), 

'"31  (z) 

* 

21 

9 

mQ  (z),  m?  (z),  m  (z) 

•k 

1 

mQ  (z),  trig  (z),  m27  (z) 

* 

3 

mQ  (z),  m21  (z) 

CF(25) 

31  * 

mQ  (z),  m1  (z),  m^  (z), 
m  (z) ,  m*  (z) ,  m  (z) , 
mj5  (z)  7  U 

mu 

15* 

m0  (z),  m1  (z),  m3  (z), 
m5  (z),  m?  (z) 

r  * 

5 

mQ  (z),  m3  (z) 

_ 

* 

3 

mQ  (z) ,  m  (z) 

Coses  included  in  breadboard 
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Table  III  -  2 


mi(z)  =  mQ 


Polynomial 


11 
"l3 
n15 
nl9 
n21 
n23 
n27 
n29 
n31 
"43 
n4  7 
"55 
n63 


(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(Z) 

(z) 

(z) 

(z) 

(z) 

(z) 

( Z ) 

(z 

(z) 

(z) 

(z) 


m.(z)  =  m„ 
1  0 

Polynomial 


m 

m 

m 

m 

m 

m 

m 

m 

m 

m 

m 

m 

m 


0 

1 

3 

5 

7 

9 

11 

13 

15 

21 

23 

27 

31 


(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(z) 

(z) 


Minimal  Irreducible  Polynomials  over  GF(2  ) 


+  nrz1  +  m.z2  +  m.z3  +  m.24  +  mnz5  +  m.z6  + 


1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 


1  0 
0  0 
1  1 
0  1 
1  1 
1  1 
0  1 
1  0 
1  1 
1  0 
0  1 
0  0 
1  0 
1  0 
0  0 
1  1 
0  0 
0  1 
0  0 


0  0  0  0 

10  0  0 

10  0  0 

110  0 

0  111 

1110 
0  10  1 

0  0  0  0 

10  11 

1  I)  0  1 

0  0  11 

0  0  0  1 

0  10  1 

10  10 

0  111 

0  0  10 

1110 
1111 
0  10  0 


Minimal  Irreducible  Polynomials  over  GF(2  ) 


+  m  z  * 


rn  z2  +  m„z3  +  m,z4  +  m  z3  +  nuz6 


1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 


10  0 
1  0  0 

1  1  0 

1  1  0 

0  0  1 

0  1  1 

Oil 
1  0  1 

0  1  0 

1  1  0 

1  0  0 

1  0  1 

0  0  0 


0  0  0 
0  0  1 
1  0  1 
0  1  1 
0  0  1 
0  0  0 
0  1  1 
1  0  1 
1  1  1 
0  0  0 
1  1  1 
0  0  0 
0  1  1 


m 

0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 


Table  III  -  3 


Minimal  Irreducible  Polynomials  over  GF(2“*) 


m^(z)  =  mQ  +  m^zl  +  m^z2  +  m^z3  +  m^z4  +  m^z5 


Polynomial  m^  m,_ 

m  (z)  1  1  0  0  0  0 

m,  (z)  10  10  0  1 

m*  (z)  10  1111 

m^  (z)  1110  11 

mi?  (z)  11110  1 

m  (z)  110  111 

(z)  100101 

4 

Minimal  Irreducible  Polynomials  over  GF(2  ) 


m . (z )  =  m_  +  m  z  *  +  m0z2  +  m_z3  +  m.z4 
i  0  1  2  3  4 


Polynomial  m^  m^  m^  m^  tn 

m  (z)  11000 

m, (z)  11001 

nu(z)  11111 

nv  (z)  11100 

n, J(z)  10  0  11 
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encoder  and  decoder.  It  will  compute  transforms  of  up  to  51  points 
8 

in  GF(2  )  and  the  transforms  in  the  smaller  fields,  as  indicated 
by  asterisks  in  Table  II.  The  hardware  implementation  of  the  bread¬ 
board  is  also  described  in  Volume  II. 

1.4.1  Multi-dimensional  Transform  Partitioning 

It  lias  been  shown  elsewhere  that  a  transform  over  CF(2n*)  of  n 
points  can  be  decomposed  into  a  multi-dimensional  transform  having 
one-dimensional  components  of  transform  lengths  n^,  n9,  .  .  n^ ,  where 
the  individual  lengths  are  a  complete  set  of  relatively  prime  factors 
of  n  [15].  In  that  work,  the  individual  one-dimensional  transforms  were 
computed  by  means  of  the  Winograd  algorithm  for  fast  cyclic  convo¬ 
lution.  The  multi-dimensional  transform  decomposition  can  also  be 
combined  with  our  algorithm  for  fast  polynomial  evaluation  to  produce 
a  fast  transform  algorithm  with  attractive  hardware  implications. 

For  example,  a  255-point  transform  over  C.F(2  )  could  be  configured 
as  a  3-dimensional  transform  of  component  lengths  of  17,  5  and  3 
poinis  respectively.  Equivalently  it  could  be  configured  as  a  2- 
dimensional  transform  of  17  and  15  points,  or  of  85  and  3  points,  or 
of  51  and  5  points.  Consider  the  last  case:  the  full  transform  can 
be  configured  as  5  transforms  of  51  points  each  followed  by  51  trans¬ 
forms  of  5  points  each.  The  51-poini  transforms  would  use  identical 
(or  time-shared)  hardware  but  operate  on  different  input  data, 
selected  as  every  5th  term  of  the  input  sequence  appropriately 
offset.  The  5-point  transforms  are  calculated  in  succession  from 
the  5  outputs  of  the  51 -point  transforms.  This  example  is  of  direct 
interest  to  us  in  the  use  of  our  51-point  transform  breadboard. 

Expansion  to  255  points  using  this  2-dimensional  decomposition  merely 
requires  replication  of  the  51-point  transform  hardware,  followed 
by  a  single  5-point  transformer  used  sequentially.  A  block  diagram 
of  such  a  configuration  is  shown  in  Figure  5.  In  practice,  a  larger 
number  of  5-point  transformers  would  be  used  (perhaps  5)  in  order 
to  maintain  throughput. 
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GF(2  )  Using  Two-Dimensional  Partitioning 


Comparison  of  the  multi-dimensional  approach  with  a  direct  method 
of  transformation,  both  bo-.ed  on  our  algorithm  for  fast  polynomial 
evaluation,  will  show  that  approximately  the  same  amount  of  hardware 
(polynomial  dividers,  field  multipliers,  logic  gates)  is  needed  in 
either  case.  But  in  the  multi-dimensional  case  there  is  greater 
opportunity  for  the  use  of  r  plicated  hardware.  In  the  example  shown 
in  Figure  5  there  are  35  polynomial  division  registers,  but  only  8 
different  types  rather  than  33  different  types  required  for  the  direct 
implementation.  There  are  255  multipliers  in  GF(2m)  required  in  the 
example,  but  only  56  different  types.  In  general  a  multi-dimensional 


transform  of  length  n  = 
to  be  performed  at  only 


i=l 
l  = 


n^  points  requires  polynomial 
k 

5"]  n.  different  roots. 
i=l  1 


evaluation 
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SECTION  IV 


REED-SOLOMON  TRANSFORM  DECODING  ALGORITHM 


Assume  that  a  message  represented  by  the  polynomial  a(z)  is 
encoded  as  a  Reed-Solomon  codeword  by  the  transform,  or  polynomial 
evaluation,  method  described  in  Section  111  and  that  a  number  of 
errors,  within  the  constraint  of  equation  (2),  occurs  in  trans¬ 
mission,  The  received  code  sequence  contains  these  errors.  If  we 
know  which  symbols  are  correctlv  received,  (which  is  the  same  as 
knowing  which  symbols  are  in  error)  we  can  choose  a  correct  subset 
of  k  linearly  independent  equations  from  the  full  set  of  equation 
(3)  and  solve  them  for  the  message  symbols.  An  equivalent  procedure, 
and  the  one  that  is  most  often  used,  is  to  find  the  error  locations 
in  the  received  sequence  and  then  to  use  them  directly  to  solve  for 
the  error  values,  correcting  the  received  code  sequence  from  which 
the  message  can  be  decoded.  Since  there  are  usually  fewer  errors 
than  message  symbols,  the  second  method  should  require  less  compu¬ 
tational  effort.  As  discussed  in  Section  II,  we  will  use  a  transform 
version  of  the  algebraic  decoding  algorithm  to  streamline  the  required 
operations.  But  we  must  first  determine  the  error  syndrome  from 
which  the  error  locator  polynomial  may  be  determined. 


4.1  Error  Svndrome  Computation 


Let  the  source  message  be  represented  by  the  polynomial 
?  k-1 

a(z)  =  +  a^z  +  +  .  .  .  +  a^  ^z  ,  which  we  will  regard  as 

having  degree  n-1  but  with  the  n-k  highest-degree  coefficients 


equal  to  zero.  The  transmitted  codeword  is  represented  by  the  poly¬ 


nomial  A(z) 
efficient  A 


=  A_  +  A, z  +  A„z^  +  .  .  .  +  A  , z°  ^  in  which  the  co- 
01  .  n-I 

is  determined  as  a(b^)  in  accordance  with  the  transform 


of  equation  (4).  If  we  were  to  apply  the  inverse  transform  of  equa¬ 


tion  (6)  to  the  coefficient  sequence  of  A(z) ,  we  would  obtain  the 


message  a(z). 
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Notice  in  particular  that  for  any  valid  A(z)  the  interpolated  message 
coefi icients  a^  must  equal  zero  for  ij-k.  This  construction  is  equiv¬ 
alent  to  choosing  the  set  {b  ^ ,  b  ,  b  b  n+-'-}  as  the 

roots  of  the  polynomial  g(z)  that  generates  the  code  ideal  in  the 
ring  of  polynomials  modulo  zn  -1. 

Assume  that  an  error  sequence  represented  by  the  polynomial 

2  n-l 

E(z)  =  1'  +  E.z  +  !•'  z~  .  .  .  +  E  ,z  has  been  added  to  the  encoded 

01  2  n-l 

message  upon  transr.i  ssion.  If  the  received  word  is  correctable. 

11 

then  E(z)  will  have  no  more  than  — —  non-zero  coefficients;  both 
their  values  and  locations  will  be  unknown.  The  received  sequence 
is  represented  by  the  polynomial  sum  R(z)  =  E(z)  +  A(z)  so  its 
inverse  transform  is  r(z)  =  e(z)  +  a(z) ,  where  e(z)  is  the  inverse 
transform  of  E(z)  and  a(z)  is  the  original  message.  The  de¬ 
coding  problem  is  to  determine  e(z)  in  order  to  extract  the  message 
a(z)  from  the  inverse  transform  r(z)  of  the  observed  sequence  R(z). 

To  compute  the  error  syndrome,  we  first  interpolate  the  poly¬ 
nomial  r(z)  from  the  values  of  the  received  sequence  R(z)  hv  taking 
its  inverse  transform, 

n-l 

r.  =  n_1  7Vb~Jl  (14) 

j=°  -1 

which  is  equivalent  to  multiplying  by  the  constant  factor  n  the 
values  R(b  '“' )  that  result  from  evaluation  of  the  received  sequence 
polynomial  R(z).  Since  b  1  c  bn  1  it  follows  that  the  transform 
structures  described  in  Section  111  can  also  be  used  to  calculate 
the  inverse  transform  ,  provided  that  we  index  the  output  values  in 
reverse  order  and  multiply  them  by  the  scale  factor  n  The  codes 
that  concern  us  here  have  symbols  that  are  binary  m-tuples,  and 
consequently  n  =  2""— 1  =  1  mod  2m  so  that  the  scale  factor  is  unity, 
all  of  which  will  be  assumed  below. 

Since  A(b  X)  =  0  for  i  >_  k  because  of  the  method  of  code  con¬ 
struction,  we  can  separate  from  equation  (14)  a  term  valid  for 
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i=k,  k  +  1,  .  .  . ,  n-1 , 

n-l 

si  =  R  (b  “)  =  ^  ^  Kb  ’ 1 ;  i=k ,  k  +  l,...,n-l.  (15) 

j=0 

The  sequence  {s^}  of  the  n-k  values  calculated  from  equation  (15) 
constitutes  the  error  syndrome  associated  with  the  channel  error 
pattern  F.(z).  This  error  syndrome  is,  by  definition,  equal  exactly 
to  the  last  n-k  values  of  the  inverse  transform  of  the  received 
channel  sequence,  given  by  equation  (14).  The  values  of  equation 
(14)  for  i  <  k  are,  in  general,  different  from  the  values  that  we  would 
obtain  by  extending  the  definition  of  equation  (15). 

4 . 2  Error  Location 

After  the  error  syndrome  has  been  calculated,  it  can  be  used  to 
determine  the  locations  of  the  errors  in  the  channel  error  pattern 
E(z).  Several  procedures  are  available  for  determining  the  error 
locations,  including  a  method  of  continued  fractions  [16] ,  another 
method  that  applies  Euclid's  algorithm  [5],  and  an  algorithm  formu¬ 
lated  by  Berlekamp  and  Massey  [8].  Common  to  all  of  these  methods 
is  the  determination  of  a  polynomial  whose  distinct  roots  designate 
the  error  locations  (or  their  multiplicative  inverses). 

In  our  work,  we  employ  the  Ber lekamp-Massey  algorithm,  judging 
it  to  be  an  available  method  that  is  computationally  efficient  and 
conceptually  satisfactory  from  a  signal  processing  viewpoint. 

It  may  be  regarded  as  an  algorithm  for  synthesizing  the  shortest 
linear  feedback  shift  register  that  generates  a  prescribed 
sequence  obtained  from  the  inverse  transform  (interpolation)  of  the 
received  code  sequence.  The  error  locator  polynomial  is  the  char¬ 
acteristic  polynomial  of  the  LFSR;  its  coefficients  uniquely  satisfy 
a  linear  recursion  with  the  first  n-k  coefficients  of  the  interpolated 
sequence.  In  our  use  of  this  algorithm,  we  include  correction  for 
erasures  by  initializing  the  algorithm  in  accordance  with  the  known 


39 


erasure  locations. 


simply  a 


The  linear  recursion  satisfied  by  the  LFSR  is 
convolution  between  the  input  sequence  and  the  feedback  co~ 


ef f ic  ients . 

A  competing  method,  one  that  may  be  simpler  conceptually  although 
not  as  convenient  for  hardware  imp’ ementat ion  with  linear  sequential 
circuits,  is  based  on  a  continued-fraction  development  of  a  power 
series  expansion  of  the  error  syndrome.  The  continued  fraction  can 
be  used  to  synthesize  directly  a  canonic  ladder  realization  of  an 
equivalent  rational  polynomial  transfer  function,  or  its  values 
can  be  used  to  compute  by  iteration  the  characteristic  polynomial  ot 
the  LFSR.  Both  methods  will  be  described. 

4.2.1  The  Ber lekamp-Massev  Algorithm 

The  Ber lekamp-Massev  algorithm  has  been  discussed  thoroughly  by 
its  authors  [b,  8]  .  The  discussion  presented  here,  which  is  taken 
in  part  from  a  previous  project  document,  is  included  merely  for 
the  sake  of  completeness  and  for  continuity  with  the  transform 
decoding  method  being  described. 

The  channel  error  sequence  K(z)  is  described  by  a  list  of  pairs 
of  field  elements,  Y.  (the  value  of  the  error)  and  X.  (an  error  lo- 
cation  determined  by  the  index  of  a  field  element)  for  the  i  symbol 
error  (7].  The  syndrome  values  may  be  expressed  in  terms  of  these 
elements  as 


v 

R(b~j)  =  s.=  ^Y.X.j;  j=k’  k+1’  •  •  •’  n_1  (16) 

i=l 

(in  accordance  with  the  code  construction  of  Section  III)  where  v  is 
the  'lemming  weight  of  the  error  pattern  or  equivalently  the  number 
of  non-zero  coefficients  of  E(z).  We  assume  v<_(n-k)/2  so  that  the 
error  bound  of  the  code  is  not  exceeded. 

The  algorithm  is  concerned  with  determining  the  coefficients  of 
an  error-locator  polynomial 
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V 

0(Z)  =  JJ  (z-X.)  -  ov  +  +  %-2z2  +  •  •  •  +  (17) 

i=l 

whose  distinct  roots  are  the  error  locations.  The  effect  of  intro¬ 
ducing  the  error-locator  polynomial  is  to  reduce  the  system  of  non¬ 
linear  equations  relating  the  error  values,  locations,  and  syndrome 
components  to  a  system  of  separate  sets  of  linear  equations. 

In  the  BC.H  decoding  algorithm,  a  system  of  equations  is  first  solved 
for  the  error  locations  X^,  which  reduces  equation  (16)  to  a  linear 
system  relating  the  syndrome  values  s.  and  error  values  Y..  The  error 
locator  polynomial  provides  an  intermediate  step  in  the  process  that 
is  useful  for  determining  the  error  locations  . 

For  the  transform  decoding  algorithm,  the  error  locator  polyno¬ 
mial  has  even  greater  signif icance.  First,  a  unique  linear  relation¬ 
ship  can  be  established  between  the  syndrome  values  and  the  coefficients 
of  the  error  locator  polynomial,  namely 


s . 
J 


a 

v 


+  s 


j+1 


+  . 


S  .  .  ,5,  +  S.J  _ 

j+v-1  1  j+v  =  0 


(18) 


which  is  valid  for  all  k  £  j  <  n-l-v.  This  relationship  is  established 
by  multiplying  (both  sides)  of  equation  (17)  by  Y^X^ ,  then  substi- 
tut  'T’g  X^  for  z,  summing  the  result  over  the  index  1  <_  i  <_  v,  and  sub¬ 
stituting  from  equation  (16).  Next,  it  can  be  proven  that  there 
exists  a  polynomial  e(z)  of  degree  less  than  n  that  satisifies 
linear  recursion  (18)  with  o(z)  for  all  1  £  j  n,  and  that  e(z) 
is  uniquely  specified  by  \>  consecutive  values  of  its  transform  E(b  ''), 
(where  for  example  j=k,  k  +  1,  .  .  .,  k+v-1)  and  E(z)  has  no  more  than 

•k 

v  non-zero  coefficients.  In  that  case  E(z)  is  the  channel  error 
pattern,  and  the  calculated  values  E(b  "* )  form  its  inverse  transform. 

"k 

The  proof  is  given  in  Appendix  A  of  reference  [2]. 
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Flit  error  locator  polynomial  can  be  determined  from  the  error  syndrome 
and  then  be  used  to  extrapolate  the  inverse  transform  e(z),  which 
must  be  subtracted  from  the  inverse  transform  r(z)  of  the  received 
channel  sequence  to  extract  the  message.  The  values  E(b  -* )  for  j  >_  k 
exactly  balance  the  syndrome  coefficients  to  produce  zeros  in  those 
message  positions.  There  is  no  need  to  explicitly  find  the  error 
locations  in  order  to  solve  a  set  of  linear  equations  for  the  channel 
error  values,  as  is  usually  done  in  the  BCH  decoding  algorithm. 

The  Ber lekamp-Massey  algorithm  provides  an  iterative  method  of 

synthesizing  the  canonic  linear  feedback  shift  register  that  has  the 

characteristic  polynomial  o(z)  which  is  used  to  extrapolate  the 

inverse  error  transform  e(z).  The  algorithm  is  in  fact  a  constructive 

{r ) 

proof  that  if  the  length  and  connection  polynomial  c  (z)  are 
known  for  the  minimal  length  LFSR  that  generates  the  sequence 
(sk,  sk+1>  •  •  •,  sr)  hut  not  the  sequence  (sk>  sk+1>  .  .  . ,  sr,  sr+1), 
then  a  valid  choice  for  the  connection  polynomial  to  generate  the 
latter  sequence  is 


r-m  (r )  ,  . 
z.  a  (z)  - 


d  d  "1o(m) (z) 
r  m 


(19) 


where  the  next  discrepancy  d 


r 


is  defined  as 


d 

r 


r+k 


sr+k-i 


(20) 


and  the  maximum  degree  of  o^r+^(z),  which  is  also  the  minimum  length 
of  the  shift  register  is 


Jr+1 


max  (L  ,  r+1  -  L^)  ;  d^  4  0 


L  , 
r 


;  d  =  0. 

r 


(21) 
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The  recursion  begins  with  the  initial  conditions  established  for 


r=0,  -1, 


=  0 


a 


(-1) 


=  1 


(0) 


=  1 


(22) 


d-l  =  1 


d0  =  sk. 


The  index  r  corresponds  to  the  rth  step  in  the  recursion  of 

equation  (19);  it  is  also  the  length  of  the  correct  seauence  generated 

by  the  minimal  shift  register  of  length  L  when  d  =0.  The  index  m  is 
C  ui )  it  r 

the  index  of  a  '  (z) ,  the  last  connection  polynomial  before  the  nrevious 
shift  register  length  change.  The  test  2L  <  m  must  be  met  before 
n  ;(z)  is  updated  in  the  recursion.  The  discrepancy  d  is  the  difference 
between  the  desired  next  value  s  ^  and  the  actual  value  computed  by 

the  approximating  shift  register  of  length  L  having  connection  poly- 

( r )  ** 

nomial  o  (z).  The  recursion  must  be  continued  until  the  minimum- 

length  shift  register  that  generates  the  error  syndrome  sequence 

sk’  sk+l’‘'‘’  sk+ >*•**’  sk+2v  'ias  keen  determined,  which  requires 

processing  all  n-k  syndrome  values  to  ensure  proper  termination. 

The  corresponding  shift  register  length  is  equal  to  v,  which  equals 

the  number  of  errors  that  occurred,  and  the  connection  polynomial 

satisfies  equations  (20)  and  (21)  uniquely  [8]. 

After  the  LFSR  has  been  synthesized  by  this  algorithm,  it  is 

necessary  only  to  continue  its  operation,  with  zero  input,  for  an 

additional  k  shifts  in  order  to  extrapolate  the  k  unknown  values  of 

the  inverse  error  transform  e(z)  [2].  These  values  are  subtracted 

from  the  corresponding  values  of  r(z)  in  order  to  decode  the  correct 

message.  This  represents  a  substantial  savings  in  finite-field 

computation  in  comparison  with  the  error-value  computation  of  the 

BCH  decoding  algorithm. 
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In  the  BCH  decoding  algorithm,  after  the  error  locations  have 
been  determined,  it  is  necessary  to  find  the  error  values  in  order  to 
correct  the  received  sequence  and  decode  the  message.  The  conventional 
(non-transform)  application  of  the  BCH  decoding  algorithm  would 
require  us  first  to  find  the  error  locations  by  searching  for  the 
roots  of  the  error  locator  polynomial  and  then  to  calculate  the  error 
values  by  sol.  ing  a  set  of  linear  equations.  Alternatively,  the 
error  value  calculation  can  be  carried  out  by  evaluating  the  residues 
of  a  partial  fraction  expansion  in  the  error  location  singularities 
of  an  error-svndrome  generating  function. 

From  the  transform  viewpoint  such  a  procedure  is  overly  compli¬ 
cated.  Instead,  the  l.FSR  that  is  characterized  by  the  error  locator 
polynomial  is  used  to  extrapolate  the  inverse  transform  of  the  error 
sequence  that  was  added  to  the  transmitted  codeword  sequence.  Since 
the  message  polynomial  is  the  inverse  transform  of  the  channel  code 
sequence,  we  need  only  to  subtract  the  inverse  error  transform  from 
the  interpolated  received  sequence  to  complete  the  decoding,  as  was 
shown  in  Figure  2. 

4.2.2  Modification  For  Correction  Of  Errors  and  F.rasures 

The  decoding  algorithm  described  above  was  concerned  only  with 
correcting  errors,  an  error  being  described  as  a  received  symbol 
of  incorrect  (or  unknown)  value  occurring  at  an  unknown  location. 

An  erasure  is  described  as  an  unknown  symbol  value  occurring  at  a 
known  location;  it  may  occur  for  example  by  observing  the  channel 
noise,  assigning  an  erasure  when  the  detection  decision  becomes 
suf  f  ic  •'  ntly  uncertain.  A  Reed-Solomon  code  can  correct  twice  as 
many  erasures  as  errors;  in  fact  it  cap  correct  any  pattern  of  t 
errors  and  s  erasures  provided  the  inequality  of  eq.  (2)  is  satisfied. 

A  useful  Reed-Solomon  decoder  should  be  capable  of  correcting  both 
errors  and  erasures,  which  requires  some  modi f icatio-1  <o  < he  decoding 
algorithm  used  for  correcting  errors  only. 
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Forney  has  described  a  modified  BCH  decoding  algorithm  for 
correcting  errors  and  erasures  [17].  In  his  method  the  known  error 
locations  are  used  to  linearly  transform  the  error  syndrome  values 
to  a  modified  syndrome  ,  the  latter  being  used  to  solve  for  the 
error  locations.  This  can  be  done,  for  example,  by  using  the  modified 
syndrome  as  input  to  the  Berlekamp-Massey  algorithm  for  determining 
the  error  locator  polynomial,  followed  by  a  root  search.  From  the 
known  errata  locations,  a  set  of  linear  equations  can  be  solved  to 
determine  the  unknown  symbol  values. 

In  our  method  of  correcting  for  errors  and  erasures  we  initial¬ 
ize  the  error  locator  algorithm  (Berlekamp-Massey)  with  the  connec¬ 
tion  polynomial  computed  from  the  known  erasure  locations.  Then, 
we  continue  the  algorithm  normally  to  synthesize  an  errata  locator 
polynomial  which  is  the  product  of  the  error  locator  polynomial  and 
the  erasure  locator  polynomial.  Once  the  errata  locator  polynomial 
is  synthesized,  there  is  no  further  distinction  between  errors  and 
erasures,  and  the  inverse  transform  of  the  errata  pattern  may  be 
extrapolated  by  free-running  the  synthesized  LFSR  as  before. 

The  erasure  locator  polynomial  y(z)  will  be  defined  as 

P 

y(z)  =  T""!-  (z  -  Xt)  =  Yp  +  Yp_1z  +  Yp_2-2  +  •  •  •  +  z°  (23) 
i=l 

where  p  erasures  have  occurred,  not  exceeding  the  minimum-distance 
bound  of  equation  (2).  The  roots  x\  designate  the  erasure  locations, 
forming  a  set  that  is  disjoint  from  the  error  locations  X^.  It  is 
convenient  to  define  an  errata  locator  polynomial  o(z)  as  the  product 
of  the  error  locator  and  erasure  locator  polynomials 

P  v 

p(z)  =  y (z)a(z)  =  J"[  (z-k±)  (z-X  )  (24) 

i=l  j=l 
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a  tool  of  t(z)  designating  either  an  error  or  an  erasure  location, 
in  other  words  an  errata  location. 

The  erasure  locator  polynomial  can  be  calculated  recursively  by 
the  formula 


y('r+1)(z)  =  zyr(c)  -  £r+1vr(z). 


y (0)  =  1. 


(25) 


By  comparison  of  this  expression  with  eq.  (19)  it  is  evident  that 
the  erasure  locator  polynomial  can  be  calculated  by  the  error  locator's 
weight  calculator  if  the  erasure  locations  are  taken  in  sequence 
in  place  of  the  next  discrepancy  values  d^.*  This  initialization 
sets  up  the  erasure  locator  Y(z)  as  the  characteristic  polynomial  of 
the  correlation  register.  During  the  initialization  the  first  p 
syndrome  values,  equal  to  the  number  of  erasures,  are  sequenced  into 
the  correlation  register.  In  the  absence  of  erasures,  the  initial¬ 
ization  reverts  to  the  error-locator  mode. 

After  the  apparatus  has  been  initialized  with  the  erasure 
locator,  the  algorithm  continues  in  the  error  locator  mode  to  gener¬ 
ate  the  errata  locator  polynomial 


Mr+l),  >  r-m'v(r).  ,  ,  ,  -l'v(m)  ,  . 

a  (z)  =  z  o'  (z)  -  d  d  a  (z) 

r  m 


dr  =  Sr+k 


♦E  Vr,w 


i=l 


with  the  initialization  at  r  =  P  given  by 


o(p)(z)  =  Y (z) 


dP  "  Vk  + 


o(m)(z>  =  o(P_1)(z)  =  Y(z) 
YiSP+k-i 


d  =  d.  .  =  1 
m  P-1 


i=l 


*See  Figure  8  of  section  5.1. 


(26) 

(27) 


(26) 
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If  we  write 


s(p) 

a(p-i) 


(z) 

(z) 


=  y(z)cj^ (z) 


_  (z)  (-1)., 

=  Y  a  (z) 


(29) 


and  observe  that  the  errata  locator  recursion  begins  at  r  =  p,  we 
can  factor  the  right  hand  side  of  eq.  (26)  to  obtain 

o(r+1^(z)  =  y(z)  [zr  mo^(z)  -  1o(m)  (z)]  (30) 

=  y(z)o(z) 

and  observe  that  the  term  in  brackets  is  the  recursion  that  calculates 
the  error-locator  tap  weights.  If  the  minimum  distance  bound  of  the 
code  has  not  been  exceeded,  2v  +  p  £  d  -  1,  then  the  remaining 
n-k-o  syndrome  digits  are  sufficient  to  uniquely  determine  the  error 
locator  factor,  a(z),  of  the  errata  locator  polynomial.  This  result, 
as  in  the  case  of  Massey's  algorithm  for  the  error  locator,  can  be 
proved  by  induction.  Extrapolation  of  the  errata  transform  from 
the  synthesized  errata  location  shift  register  implicitly  follows 
from  the  error-extrapolation  proof  in  Appendix  A  of  reference  [2J. 
4.2.3  Continued  Fraction  Algorithm 

Recently  a  method  of  continued  fractions  has  been  advocated  for 
use  as  an  equivalent  to  the  Berlekamp- Massey  algorithm  for  decoding 
Reed-Solomon  codes  [16],  As  part  of  our  work,  we  examined  this 
approach,  but  our  analysis  indicates  slightly  reduced  compu¬ 
tational  complexity  of  the  Ber lekamp-Massey  algorithm,  and  its  con¬ 
venience  of  hardware  implementation,  which  we  continue  to  prefer.  A 
summary  of  our  examination  of  the  continued-fraction  method  is  given 
below. 
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Ir  the  error  syndrome  is  expressed  as  a  power  series 


,  -1,  -1  -2 

s(z  )  =  V  +  w  + 


-(n-k) 

s  ,  z  + 

n-1 


(31) 


it  is  possible  by  successive  polynomial  division  to  exactly  approx¬ 
imate  the  series  by  the  finite  Stleltjes  continued-fraction, 


s(z  1) 


u^z+1 


V1 + 1 


u^z+l 


v,+ 


(32) 


+  1 

u  z+1 

v 


V 

V 


The  process  of  successive  division  terminates  after  2v  steps  where 
v  is  the  weight  of  the  error  pattern.  The  continued-fraction  express¬ 
ion  can  be  used  directly  to  synthesize  the  canonic  ladder  realization 
of  a  digital  filter  network,  as  shown  in  Figure  6.  The  unit  sample 
response  of  this  filter  generates  (periodically)  the  inverse  trans¬ 
form  e (z)  of  the  channel  error  sequence  E(z),  which  accomplishes 
the  same  function  as  free-running  the  LFSR  synthesized  by  the 
Berlekamp  Massey  algorithm.  In  fact,  it  can  be  shown  that  the  com¬ 
panion  matrix  of  the  ladder  network  of  Figure  6  and  the  corresponding 
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.adder  for  Stieltjes  Continued 
m  Realization  (v=3) 


LFSR  have  the  same  characteristic  polynomial  o(z). 

Instead  of  using  the  ladder  network  we  could  also  use  the 
Stieltjes  continued-fraction  convergent  to  form  successive  approxi¬ 
mations  to  the  error  locator  polynomial,  by  the  iterative  set  of 
computations 


a(0) 

c(1) 

o(2> 

a(3> 


(z) 

(z) 

(z) 

(z) 


ulz 

Vla(1)(z)  +  a(0) (z) 
u2a(2)  (z)  +  a(1) (z) 


(33) 


o(2v)(z)  - 


(2v-l),  , 
vvo'  (z) 


a(2-2)(z) 


o(z) 


which  terminates  with  the  last  step  of  the  successive  division.  The 
final  iteration  produces  the  connection  polynomial  of  the  LFSR 
used  to  extrapolate  the  inverse  error  transform. 

In  comparing  the  computational  complexity  of  the  Berlekamp-Massey 
algorithm  with  the  method  of  continued  fractions,  observe  that  both 
methods  in  effect  synthesize  a  recursive  filter  determined  by  the 
error  locator  polynomial.  Once  this  filter  is  synthesized,  the  same 
number  of  computational  steps  are  required  to  extrapolate  the  error 
transform,  so  the  relative  complexity  can  be  compared  for  the  filter 
synthesis  step  of  the  decoding  algorithm.  The  complexity  of  the  two 
methods,  measured  by  the  number  of  finite-field  products  computed  is 
relatively  equal ,  the  Berlekamp-Massey  algorithm  always  providing  the 
slightly  lower  value.  But  there  is  an  important  difference  for  hard¬ 
ware  implementation.  The  Berlekamp-Massey  algorithm  operates  on  the 
syndrome  values  sequentially,  allowing  for  serial  computation  of  the 
syndrome. 
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The  continued-fraction  algorithm  performs  a  polynomial  division  in¬ 
volving  the  error  syndrome  values  at  each  step  of  the  recursion,  thus 
the  syndrome.  computation  must  be  complete  before  the  continued- 
fraction  algorithm  can  proceed.  An  advantage  of  the  continued 
fraction  method  is  that  the  number  of  recursions  never  exceeds  the 
number  of  errors  being  corrected.  For  the  hardware  design  of  our 
Reed-Solomon  decoder  we  prefer  the  serial  computation  cf  the  syndrome. 
4 . 3  Message  Extraction 

Following  synthesis  of  the  error  (or  errata)  locator  polynomial, 
the  inverse  error  transform  is  produced  as  the  unforced  response  of 
the  synthesized  LFSR  and  the  message  is  extracted  by  subtracting  the 
inverse  error  transform  from  the  inverse  transform  of  the  received 
channel  sequence.  Proof  that  the  error  locator  polynomial  may  be 
used  to  generate  the  inverse  error  transform  may  be  found  in  Appendix 
A  of  reference  [2],  The  proof  relies  on  two  lemmas  which  can  be  used 
to  establish  the  uniqueness  of  the  linear  recursion  between  the  in¬ 
verse  error  transform  and  the  error  locator  polynomial,  and  the  suf¬ 
ficiency  of  the  error  syndrome  to  uniquely  define  the  error  locator. 
The  reader  is  referred  to  the  reference  for  the  details  of  the  proof. 
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SECTION  V 


HARDWARE  BESKIN  (SUMMARY) 

It  has  consistently  been  our  goal  in  this  work  to  produce  a 
design,  suitable  for  large-scale  circuit  integration,  that  accommodates 
many  Reed-Solomon  code  parameters  in  order  to  maximize  the  utility 
of  the  hardware.  A  number  of  design-related  tradeoff  studies  were 
performed  to  resolve  the  issues  of  hardware  programmability,  functional 
partitioning,  speed  versus  gate  complexity,  interface  and  control. 

This  work  successfully  culminated  in  the  complete  design,  at  a  basic 
logic  level,  of  a  versatile  decoder  and  in  the  hardware  implementation 
of  a  breadboard,  the  complete  details  of  which  are  reported  separately 
in  Volume  II  of  this  report  [41.  Salient  features  of  our  hardware 
design  are  described  below. 

5 . 1  Functional  Partitioning  and  Programmability 

I'he  two  principal  processing  functions  used  in  our  Reed-Solomon 
decoder,  the  Transformer,  which  is  used  both  for  code  generation  and 
error  syndrome  computation,  and  the  Error  Locator,  which  is  used  both 
to  compute  the  error  locator  polynomial  by  means  of  the  Berlekamp- 
Massey  LFSR  algorithm  and  to  extrapolate  the  inverse  transform  of  the 
channel  error  sequence  for  use  in  extracting  the  corrected  message 
symbols,  are  described  schematically  in  Figure  7  and  Figure  8.  They 
have  both  been  designed  to  incorporate  programmability  to  accommodate 
a  wide  range  of  code  parameters.  The  total  range  of  Reed-Solomon 
codes  that  our  design  can  implement  is  shown  in  Table  IV.  From  this 
table,  t  the  number  of  correctable  errors  is  determined  as  one-half  the 
redundancy,  or  (— ^— ) .  There  are  588  separate  codes  identified  by  the 
code  parameters  (n,k,t)  that  can  be  decoded.  The  subset  of  multiple- 
error  correcting  codes  of  rate  approximately  one-half  that  can  be 
decoded  is  shown  in  Table  V;  there  are  17  of  these. 
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Figure  8.  Error  Locator 


TABLE  IV 


Total 


Table  V:  Half-Rate  Codes  Accommodated 
by  the  Hardware  Design 


(n,k)  code 

Bits/Symbol 

(255,  127) 

8 

(127,  63) 

7 

(85,  42) 

8 

(63,  31) 

6 

(51,  25) 

8 

(31,  15) 

5 

(21,  10) 

6 

(17,  8) 

8 

(15,  7) 

8,4 

(9,  4) 

6 

(7,  3) 

6 

(5,  2) 

8,4 

(3,  1) 

8,6,4 
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Programmability  of  the  hardware  is  achieved  by  stored  program 
control  of  electronically  reconf igurable  feedback  shift  registers, 
transversal  cross-correlators,  and  finite-field  multipliers.  For 
the  transformer  circuit  of  Figure  7,  the  field  elements  required  by 
the  programmable  multiplier-accumulator  that  computes  the  residues 
of  the  divider  outputs  is  generated  by  a  programmable  linear  feedback 
shift  register.  The  polynomial  division  registers,  which  divide  the 
incoming  channel  sequence  by  the  minimal  polynomials  of  the  field 
elements  and  store  the  remainders,  are  also  configured  as  binary 
programma: le  linear  feedback  shift  registers,  both  the  feedback  con¬ 
nections  and  lengths  being  reconf igurable  by  external  program  control. 
Notice  that  the  division  registers  shown  schematically  in  the  figure 
actually  represent  a  set  of  m  identical  binary  feedback  shift  registers, 
each  operating  independently  on  a  component  of  the  m-bit  input  symbol. 

It  is  expected  that  the  entire  set  of  logic  functions  for  the  transformer 
could  be  fabricated  on  a  single  VLSI  (or  VHSIC)  integrated  circuit, 
the  processor  complexity  requiring  about  2000  shift  register  stages 
and  about  9000  logic  gates.  The  program  control  would  most  likely  be 
on  a  separate  control  chip  containing  a  modest  amount  of  programmable 
read-only  memory,  easily  attainable  with  today's  LSI  design  rules. 

Custom  LSI  ’mplementation  of  the  transformer  might  require  several 
chips . 

The  error  locator  circuit,  used  also  to  generate  the  message 
correction  symbols,  has  a  transversal  structure  that  can  be  either 
partitioned  or  fabricated  in  identical  slices  as  indicated  in  the 
dashed  portion  of  Figure  8  for  the  Weight  Calculator  and  Digital  - 
Correlator  sections.  The  digital  correlator  section  can  be  thought 
of  as  a  programmable  transversal  filter  configured  as  a  cross-correlator 
(or  convolver),  the  syndrome  digits  providing  one  set  of  inputs  and  the 
coefficients  of  the  characteristic  polynomial  providing  the  other. 

The  weight  calculator  performs  the  successive  approximations  of  the 


characteristic  (error  locator)  polynomial  and  is  also  used  to  initial¬ 
ize  the  digital  correlator  by  calculating  the  erasure  polynomial  re¬ 
cursively  from  the  known  erasure  locations.  The  remaining  section 
combines  partial  vector  products  produced  by  the  digital  correlator 
section,  performs  the  finite-field  residue-class  reduction  required, 
and  provides  input  and  control  functions  for  the  weight  calculator 
and  digital  correlator  sections. 

The  error  locator  circuit  complexity  required  for  a  rate  one- 
half  code  of  block  length  255  would  require  about  4500  shift  register 
stages  and  about  1400  logic  gates.  Because  of  the  slice  fabrication, 
we  would  expect  this  level  of  complexity  to  be  attainable  on  a  single 
VHSIC  integrated  circuit  chip.  For  custom  LSI  one  might  expect  to 
be  able  to  fabricate  a  16-symbol  section  on  one  substrate;  this  would 
require  8  sections  for  the  same  code,  plus  an  additional  LSI  chip  for 
the  arithmetic  and  logic  sections.  One  each  of  these  custom  LSI 
circuits  would  accommodate  a  Reed-Solomon  (31,15)  code;  an  additional 
small  number  of  custom  LSI  transformer  chips  would  be  required  to 
complete  the  decoder. 

5. 2  Throughput 

For  the  design  described,  if  the  error  locator  uses  digital 
logic  that  operates  at  the  conservative  (medium-scale  logic)  clock 
rate  of  10  MHz;  then  it  takes  about  2  psec  to  correct  a  message  symbol, 
or  2Npsec  to  decode  an  N-symbol  codeword.  For  rate  one-half  codes, 
there  is  no  output  for  about  one-half  the  time  while  the  error  locator 
polynomial  is  being  synthesized.  During  the  remaining  half  of  the 
block  duration,  the  symbols  are  corrected  sequentially.  This  dead¬ 
time  could  be  used  effectively  with  a  pair  of  decoders  processing 
alternate  code  blocks.  For  the  conservative  10  MHz  example,  this  would 
permit  a  constant  message  throughput  of  500K  symbols  (m-bits)  per 
second;  decoding  times  for  rate  one-half  codes  are  shown  in  table  VT . 
The  higher  clock  rates  expected  for  LSI  and  VHSIC  technology 
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Table  VI 


Decoding  Times  for  Half-Rate  Reed-Solomon  Codes 


Code 

Bits/Symbol 

1 - - 

Block  Decoding  Time* 

(255,  127) 

8 

1,020  usee 

(127,  63) 

7 

508  usee 

(85,  42) 

8 

1 

340  usee 

(63,  31) 

6 

252  usee 

(51,  25) 

8 

204  usee 

(3.1,  15) 

5 

124  usee 

(21,  10) 

6 

84  usee 

(17,  8) 

8 

68  usee 

(15,  7) 

8,  4 

60  usee 

* 


Assuming  10  MHz  Logic 
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would  permit  a  proportionately  higher  throughput.  The  medium-scale 
logic  breadboard  designed  and  constructed  under  this  project  will 
decode  a  Reed-Solomon  (31,15)  code  completely  in  124  ysec  with  50% 
efficiency . 
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